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For some time it had been suspected 
that values assigned to NIST worldng 
standards of mass were some 0. 17 mg/ 
kg larger than mass values based on 
artifacts representing mass in the Inter- 
national System of Units (SI). This rela- 
tively small offset, now confirmed, has 
had minimal scientific or technological 
significance. The discrepancy was re- 
moved on January 1, 1990. We docu- 
ment the history of the discrepancy, the 
studies which allow its removal, and the 
methods in place to limit its effect and 
prevent its recurrence. For routine cali- 
brations, we believe that our working 
standards now have a long-term stability 
of 0.033 mg/kg (3cr) with respect to the 
national prototype kilograms of the 



United States. We provisionally admit 
an additional uncertainty of 0.09 mg/kg 
(3cr), systematic to all NIST mass mea- 
surements, which represents the possible 
offset of our primary standards from 
standards maintained by the Bureau In- 
ternational des Poids et Mesures 
(BIPM). This systematic uncertainty 
may be significantly reduced after analy- 
sis of results from the 3rd verification of 
national prototype kilograms, which is 
now underway. 

Key words: calibration; international 
standards; kilogram; mass; national stan- 
dards; SI; standards. 

Accepted: January 6, 1990 



1. Introduction 

Tlie kilogram (kg) is one of the seven base units 
which form the foundation of the Systeme Interna- 
tional d'Unites or International System of Units, 
abbreviated SI. Used world wide to express the re- 
sults of physical measurements, the SI specifies that 
the kilogram is the unit of mass and that the mass of 
the International Prototype Kilogram exactly 
equals 1 kg. The International Prototype referred 
to in the definition is a cylinder made of an alloy of 
platinum and iridium and stored at the Interna- 
tional Bureau of Weights and Measures (BIPM) in 
France. The kilogram is thus the only remaining 
base unit of the SI to rely on an artifact for its 
definition. 

When the SI was established, replicas of the In- 
ternational Prototype were manufactured by the 



BIPM for use as national prototype kilograms. At 
long intervals, the national prototypes are returned 
to the BIPM where their assigned mass is verified 
by measurements directly traceable to the Interna- 
tional Prototype [1]. It was intended by the 
founders of the SI that the national prototype kilo- 
grams would be the primary mass standards within 
each country. There are, however, several practi- 
cal difficulties with this scheme. The following dis- 
cusses the reasons for these difficulties and the 
steps we have taken to overcome them. 

In order for the kilogram unit to be useful, meth- 
ods must exist to measure multiples and submulti- 
ples of 1-kg standards. These methods, when 
successful, rely on good equipment and sound ex- 
perimental practice. In addition to these, a calibra- 
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tion service requires rigorous tests to maintain 
statistical control of the measurement process. At 
NIST, statistical rigor was introduced in the 1960s 
through the pioneering work of Pontius and 
Cameron [2]. Present methods are simply refine- 
ments of the system which they established. 

The uncertainty of a 1-kg standard, expressed as 
a dimensionless ratio, propagates directly to mass 
values of multiples and submultiples derived from 
the standard. For example, if a kilogram standard 
has a relative uncertainty of 1 ppm', all multiples 
and submultiples derived from the standard will 
have an uncertainty component of 1 ppm propa- 
gated from the standard. In the field of precision 
measurement, uncertainty is usually reported at an 
estimated level of 1 standard deviation. All uncer- 
tainties are combined by the root-sum-square 
(RSS) method according to guidelines recom- 
mended by the International Committee for 
Weights and Measures (CIPM) [3]. In NIST cali- 
bration reports, on the other hand, uncertainties are 
estimated at a level of 3 standard deviations. Fur- 
thermore, any uncertainty deemed "systematic" to 
a series of measurements is added directly to the 
"random" uncertainties, which are combined by 
RSS. However, in the rest of this paper, we follow 
the CIPM recommendations unless otherwise 
noted. 

In addition to the SI, the United States recog- 
nizes the U.S. Customary System of units for legal 
metrology. In this system, the avoirdupois pound 
(lb) is the unit of mass. It is, by definition, exactly 
equal to 0.45359237 kg. 



2. History of NIST Mass Standards 

Before 1980 
2.1 Primary Mass Standards of Platinum-Iridium 

Kilograms K20 and K4 are the two national pro- 
totypes of the United States. Kilogram K20 has his- 
torically been considered the primary U.S. 
kilogram standard with K4 being relegated to use 
as a "check standard." The history of these two 
artifacts through 1985 has already been docu- 
mented in a previous report [1]. One important 
question which remained open in [1] is whether the 
mass values assigned by BIPM to their working 
standards have been consistent with the SI defini- 
tion of mass. The cause for concern was that the 
embodiment of the SI definition, the International 
Prototype Kilogram, had not been used since 1946. 
This situation has changed within the past year as 

' 1 ppm= I part per million= 1 X I0~^ 



BIPM embarked on only the third calibration of 
national prototype kilograms since 1889. Prelimi- 
nary results obtained by BIPM as a part of the 3rd 
verification confirm the long-term stability of their 
working standards to within required limits [4]. 

2.2 Secondary Mass Standards 

Platinum-iridium alloy (approximate density 
21,500 kg-m~') is too expensive a material for wide- 
spread use. At present, stable alloys of non-mag- 
netic stainless steel (approximate density 8,000 
kg-m~') are usually specified for use as secondary 
standards. Before such alloys were available, prac- 
tical standards were typically made of plated brass 
(approximate density 8,400 kg-m~'). The densities 
of these alloys assume importance because mass 
metrology is almost always performed in the ambi- 
ent air (density ca. 1.2 kg-m"') using balances 
which are, in essence, force or torque transducers. 
The effect of air buoyancy thus becomes a con- 
founding influence which must be removed by cor- 
rection. 

The size of the necessary buoyancy correction 
relative to the mass of interest is given by: 

(1 -Pa/ps)/(l -pyp.)- 1 ~Pa(l/p. - 1/Ps)> (1) 

where pa= ambient air density 

Ps= density of the known standard 
P;t= density of the unknown secondary 
standard. 

Equation (1) makes clear that, when comparing 
weights of nearly equal density, the importance of 
the correction is relatively small. Buoyancy correc- 
tions are typically 10 ppm between alloys of stain- 
less steel and brass; corrections of less than 5 ppm 
are typical for comparisons between various alloys 
of non-magnetic stainless steel. (Specifications for 
the highest quality analytical weights limit the al- 
loy density to within a narrow range in order to 
ensure that buoyancy corrections between nomi- 
nally equal weights will be small.) 

By contrast, the buoyancy correction between (i) 
primary standards of platinum-iridium alloy and (ii) 
secondary standards of brass or stainless steel typi- 
cally ranges from 87-97 ppm. In our laboratory, 
the densities of secondary kilogram standards are 
determined by hydrostatic weighing. The density 
of ambient air is now determined from the CIPM- 
1981 equation-of-state for moist air [5]. The latter 
requires knowledge of ambient temperature, baro- 
metric pressure, relative humidity, and carbon- 
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dioxide level. A discussion of the accuracy which 
can be expected from buoyancy corrections in our 
laboratory is given in [1]. 

The above considerations dictate that calibra- 
tions carried out by NIST on a routine basis be 
performed with secondary standards having a den- 
sity near to that of the unknown weight. 

2,2.1 Ni and N2 Two weights, designated Ni 
and N2, have served as NIST secondary standards 
of mass since 1965. The weights were fabricated in 
1948 of a nickel-chromium alloy having a nominal 
density of 8,340 kg-m"^ which is close to that of 
the brass weights which were then in common use. 
These weights were given an initial calibration in 
terms of a platinum-iridium prototype (K4) in 1948. 
They were recalibrated against both K20 and K4 in 
1958. The newer calibration gave mass values 
which were systematically higher by about 0.06 
mgAg. There is no indication in the existing 
records what, if any, uncertainty was assigned to 
either calibration. When, in 1965, Ni and N2 were 
placed in service as secondary mass standards, the 
mass assigned to them was based on selected data 
from the 1958 series of measurements. Presumably, 
this decision was made because the 1958 measure- 
ments were performed by remote control on a two- 
pan, Rueprecht balance having a standard 
deviation below 0.02 mg. By 1965, this device had 
been replaced by a single-pan balance which was 
much more convenient to use but which had an 
inferior standard deviation of about 0. 1 5 mg. Fur- 
ther, remote weighing was not possible on the sin- 
gle-pan balance. 

Based on the 1958 measurements, the mass of Ni 
and N2 taken together was calculated to be: 

i?=2kg-10.059mg. 

The difference in mass between Ni and N2 was cal- 
culated by pooling a large amount of data: 

C = - 19.476 mg. 

These two numbers, R and C, fix the individual 
values of each kilogram. The uncertainty in C is 
largely statistical in nature. It depends almost en- 
tirely on the standard deviation of the balance used 
to compare the mass of Ni with N^. Thus its uncer- 
tainty could be rigorously assigned. In addition, the 
significance of any measured change in C could 
also be determined. 

The uncertainty of R was much more problem- 
atic. The statistical component of this uncertainty 



resulting from the balance used in the measure- 
ments may, of course, be calculated. There are at 
least two additional components which increase 
the imcertainty of R (but not of C): 

1 . The uncertainty in the accepted mass of K20 
with respect to the International Prototype 
Kilogram. 

2. The accuracy of the correction for air buoy- 
ancy between the platinum-iridium and the 
nichrome kilograms. 

Rather than base an estimate of these uncertainties 
on what was considered insufficient metrological 
data, calibration reports prior to January 1, 1990 
state: 

"It is assumed that the present 'accepted values' of 
the two NIST standards at the 1 kilogram level, 
designated Ni and N2, are without error. Estimates 
of the uncertainty of the accepted values of the 
NIST standards relative to the International Proto- 
type Kilogram can be provided on request. How- 
ever, these estimates have no real meaning in either 
national or international comparison. This is be- 
cause of the lack of sufficient data to provide a 
realistic estimate of the uncertainty in the values 
assigned to the prototype kilograms K20 and K4, 
particularly in regard to long term, or between-run 
variability. Changes in the accepted values for the 
NIST standards at the kilogram level, as and when 
they occur, will be reported in the scientific papers 
of the Bureau and will be given wide distribu- 
tion..." 

Except for the change in name of the institution, 
the above wording had been in place at least since 
1967. The reports of that time (and well beyond) 
also referenced a technical note entitled "The Ac- 
cepted Values of the NBS Standards at the 1 kg 
Level and Associated Uncertainty Estimates," to 
be published at a future date. Unfortunately, this 
note was never produced. Section 3 of the present 
paper may therefore be regarded as fulfilling a 
promise of long standing. 

In looking over calibration documentation ex- 
tending back 25 years, it seems that the original 
intention was to reserve Ni and N2 for calibration 
of other working standards of similar density. 
These working standards would be used in routine 
calibration work and thereby would spare Ni and 
N2 from excessive wear. But the calibration of 
working standards of 1 kg could only be done on 
the single-pan balance mentioned above. Thus 
working standards would be assigned an uncer- 
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tainty which was large relative to the precision of 
commercially available balances unless the calibra- 
tion were based on the average of many measure- 
ments. But the latter strategy would no longer 
spare Ni and N2 from excessive use. 

Faced with this problem, Ni and N2 began to be 
used as working standards themselves in routine 
calibrations. They were never cleaned (except for 
gentle dusting with a brush) in order to prevent 
discontinuous changes in their mass. It was, of 
course, recognized that checks must be established 
to ensure the constancy of the mass assigned to the 
summation of Ni and N2. Two criteria were rou- 
tinely used. 

The first criterion was the constancy of C. A 
measurement of C was available each time N| and 
N2 were used. In time, a newer balance of similar 
design was obtained. This device, which is still in 
use, has a standard deviation of about 0.035 mg. If 
values of C were seen to change significantly with 
time, it would mean that the summation mass of Ni 
and N2 had deviated from its accepted value. This 
test is effective in checking whether one or the 
other kilograms has suffered damage since its last 
use. However, the test fails to detect changes com- 
mon to both artifacts. Because N, and N2 are virtu- 
ally identical and receive identical use, such 
changes cannot be ruled out a priori. Thus the con- 
stancy of C is not a sufficient test to rule out a 
change in the summation mass of the two kilo- 
grams. A control chart showing values of C over 
time is given in figure 1. The second criterion is 
discussed below in section 2.2.2. 
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Figure 1. Mass values of Ni — N2 as a function of time. Measure- 
ments were taken on a balance having a standard deviation of 
0.035 mg for a single reading. 



In 1969, the masses of Ni and of N2 were redeter- 
mined 10 times with respect to K20 and K4. Mea- 
surements were made on a one-pan balance having 
a standard deviation of 0.14 mg for a single obser- 
vation. The results of these measurements indicated 
that Ni and N2 were an average of 0.09 mg/kg be- 
low their accepted value. However, because the 
uncertainties propagated from the prototype kilo- 
grams and from the correction for air buoyancy 
could still not be assessed, these data were not 
used. 

2.2.2 100-g Check Standards The second 
criterion used to monitor the constancy in mass of 
N) and N2 was the evolution in time of two 100-g 
"check" standards. A measurement of one or the 
other of these standards in terms of Ni and N2 was 
obtained each time a routine calibration was per- 
formed on a set of weights from 1 kg to 100 g. Such 
measurements are carried out dozens of times each 
year. If the mass of the 100-g check standards was 
seen to change over time, it would be evidence that 
either their mass or that of Ni and N2 was chang- 
ing. It is unlikely that the mass of the 100-g check 
standards would change in exact proportion to the 
mass of the 1-kg working standards. This test suf- 
fers, however, from low precision. The statistical 
precision in the assignment of mass to a 100-g stan- 
dard is about ten-times lower than the relative pre- 
cision of mass assigned to 1-kg weights. The reason 
is simply that all mass comparisons between 1 kg 
and 100 g are performed on the same balance. One 
would need to average about 10^ mass determina- 
tions of a 100-g check standard in order to have the 
same relative precision as one single mass determi- 
nation of a 1-kg standard. 

The 100-g check standard suffers from an addi- 
tional problem. Since it receives heavy use, its mass 
can reasonably be expected to decrease with time 
due to wear. Control charts showing mass values 
obtained over time for our 100-g check standards, 
JMC-1 and JMC-2, are given in figure 2. The ap- 
parent rapid loss in mass early in the service life of 
JMC-1 is not unusual. Such behavior is also seen, 
for instance, in our 1-g check standard where there 
can be no possibility that the source is instability in 
1-kg working standards. Thus the 100-g check stan- 
dards, while essential to guard against measure- 
ment blunders and catastrophic changes in working 
standards, are themselves susceptible to long-term 
instability. 
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Figure 2. Mass values of 100-g check standards JMC-l(a) and 
JMC-2(b). These values are based on the accepted mass of Ni 
and N2 prior to January 1, 1990. 

2.2.3 State Laboratories Each state within the 
United States maintains a well-equipped laboratory 
for primary mass metrology, typically placed ad- 
ministratively within the State Department of 
Agriculture. Training of personnel and many as- 
pects of quality control are coordinated through 
the NIST Office of Weights and Measures (OWM). 
The OWM organizes regional round-robin mea- 
surements involving State mass-standards of vari- 
ous nominal values. These round-robins also 
include standards recently calibrated by NIST. An 
examination of round-robin results for 1-kg masses 
does not reveal systematic differences between 
NIST and the States developing over time. But the 
precision of these comparisons limits conclusions 
to about 0.5 mgAg. 



2.3 Fundamental Measurements 

Some fundamental constants offer a check on the 
constancy of mass standards. During the 1970s, 
measurements of the Avogadro constant A'a [6] 
and the Faraday constant F relied directly on mass 
values maintained at NIST. These measurements 
can be compared with related measurements at 
other laboratories as is done during periodic CO- 
DATA adjustments of the fundamental constants 
[8]. 

In the case of the NIST determination of the 
Faraday constant, routine mass calibrations of a 5-g 
and 3-g working standard were used. It was esti- 
mated that the uncertainty in these calibrations was 
0.5 ppm (standard error). This estimate contributed 
less than 10 percent of the combined experimental 
uncertainty. The Faraday constant has, therefore, 
little bearing on the present discussion. 

This is not true in the case of the Avogadro con- 
stant. In order to have their mass values directly 
traceable to national standards, the experimenters 
made direct use of K20 and K4. Several calibra- 
tions at the 1-kg level were carried out on the 
newly developed NBS-2 balance [9]. This balance 
operates under remote control and, at that time, 
had a standard deviation of less than 0.005 mg. (Af- 
ter initial testing at NBS, the balance was trans- 
ferred to the BIPM where improved conditions 
have reduced its standard deviation five-fold.) Un- 
fortunately, Ni and N2 were not measured during 
the experiments, although several stainless-steel 
kilograms were calibrated in terms of K20 and K4. 
Two of these kilograms had also been measured 
against K20 and K4 in 1969 as part of the series of 
mass determinations which included Ni and N2 (see 
sec. 2.2.1, above). These results were completely 
consistent with the 1969 measurements and thus 
raise the question of whether the mass values for 
Ni and N2 dating from 1958 were still appropriate. 



3. History of Mass Standards after 1980 

About 10 years ago, NIST began a program to 
tie the mass values disseminated by its calibration 
services with international standards., It was fore- 
seen that improvements in commercial balance 
technology and improved precision in measuring 
critical fundamental constants would soon make 
this step necessary. In addition, questions of inter- 
national compatibility of national standards began 



83 



Volume 95, Number 1, January-February 1990 

Journal of Research of the National Institute of Standards and Technology 



to be raised at this time. In order to assess the 
presently accepted values of NIST secondary stan- 
dards with respect to the SI, four major areas had 
to be addressed: 

1 . A meaningful calibration of K20 and K4 with 
respect to accepted representations of SI stan- 
dards. 

2. A reliable method for making corrections for 
air buoyancy between primary standards of 
platinum-iridium and secondary standards of 
nichrome or stainless steel. 

3. A balance which could compare kilogram 
masses with a precision no worse than 0.005 
mg. 

4. Demonstration that primary standards could 
indeed be used periodically to calibrate sec- 
ondary standards and that mass values so de- 
termined did not suffer from serious, unex- 
plainable discontinuities. 

We now briefly describe efforts made in these 
four areas. 

3.1 Tie to International Standards 

As mentioned in section 2.2.1 above, the main 
reason given in the past for not basing mass calibra- 
tions on routine comparisons with K20 was that the 
long-term stability of platinum-iridium prototype 
kilograms had not been rigorously established. One 
reason for this apparent lack of understanding is 
the infrequency with which the International Pro- 
totype Kilogram is used. The BIPM faces this same 
problem because it is their job to recertify national 
prototype kilograms upon request and to provide 
new national prototype kilograms when required. 
These activities must be carried out during the long 
intervals when the International Prototype Kilo- 
gram is not accessible. 

As described in [1], the BIPM has set in place the 
following system in which all the mass standards 
involved are made of platinum-iridium alloy. 

Two working standards are used in the calibra- 
tion of an unknown prototype. The measured dif- 
ference in mass between the two working 
standards is used to check that neither has suffered 
a catastrophic change in mass. The working stan- 
dards are cleaned at about 15-year intervals. Within 
these intervals, however, their mass is redeter- 
mined periodically against a third kilogram which 
is reserved for just this use. This third kilogram is 
cleaned just prior to its use in recalibrating the 
working standards. Based on the history of the last 
40 years, it appears that the BIPM represen- 



tation of the SI unit of mass is stable to within 
about 0.02 mg (0.02 ppm). Therefore, it seems a 
reasonable goal to achieve compatibility with the 
mass representation currently maintained at the 
BIPM. These measurements are reported in detail 
in [1]. 

3.2 Corrections for Air Buoyancy 

In eq (1), the quantity p^ is typically determined 
from an equation-of-state for moist air. The inputs 
to this equation are temperature, barometric pres- 
sure, relative humidity, and ambient level of carbon 
dioxide. The last of these has relatively little effect. 
It is obvious that errors in measuring the required 
experimental input parameters will propagate to 
the final result. In the 1970s, however, it was ap- 
preciated that the equation-of-state itself has great 
importance and that several such equations were in 
wide use. Furthermore, it had not yet been demon- 
strated experimentally that any of the equations-of- 
state in use were adequate for actual mass 
comparisons. 

At NIST, Jones derived a semi-empirical equa- 
tion-of-state based on up-to-date data [10]. This 
equation, with minor changes, was endorsed for 
use in mass metrology by the CIPM in 1981 [5]. 
The equation given in [5] is now referred to as the 
CIPM-81 equation-of-state for moist air and is used 
for mass metrology by most national laboratories. 
The NIST began using this equation for interna- 
tional work in 1981. Use of CIPM-81 instead of its 
predecessor [11] makes a negligible change to rou- 
tine mass calibrations. As of January 1, 1990, how- 
ever, CIPM-81 has been adopted for use in all 
calibration software. 

In order to test the efficacy of CIPM-81, it is 
necessary to determine the mass difference be- 
tween two nominally equal weights with and with- 
out reliance on the equation-of-state. The latter 
measurement is typically done in vacuum. This 
type of comparison was done at the Physikalisch- 
Technische Bundesanstalt (PTB) [12]. Results 
agreed to within the expected uncertainty, 1 X lO""* 
in p.. 

It is also necessary to measure the input parame- 
ters with sufficient accuracy. In general, this re- 
quires the use of transducers whose calibration is 
checked at frequent intervals by defining instru- 
ments. Our capabilities as they existed in 1985 are 
described in [1]. Since that time, we have improved 
the accuracy of our measurements of barometric 
pressure and of relative humidity. 
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3.3 Improved Balance 

The balance used for primary mass metrology 
must operate by remote control in order to ensure 
that the weights being compared remain in suffi- 
cient equilibrium with the air of the weighing 
chamber. Schoonover and Keller have demon- 
strated that severe systematic errors may intrude if 
the equilibrium constraints are violated [13]. In ad- 
dition, the balance itself must have sufficiently high 
precision. We consider the balance to be suitable 
when either of the two following conditions is met: 
1. The contribution of the balance imprecision to 
the uncertainty of working standards is negligible 
compared to the imprecision of routine mass cali- 
brations. 2. The imprecision of the balance is negli- 
gible compared to typical instabilities of mass 
standards. 

In [1], we described modifications made to an 
existing balance which allowed it to fulfill the first 
criterion. Although working reasonably well, we 
wanted to improve efficiency by fully automating 
it. In order to make the job of automation more 
straight forward, the balance was fitted with an 
electro-magnetic servocontrol system [14]. Intro- 
duction of the servocontrol also resulted in a mod- 
est improvement in precision [15]. 

3.4 Stability of Mass Values 

It remains to demonstrate that the work under- 
taken since 1980 has led to an improved representa- 
tion of the SI unit of mass. 

3.4.1 K20 and K4 The most recent mass value 
for kilogram K20 results from the 1984 calibration 
at the BIPM [1]. As discussed in [1], the cleaning 
process at the BIPM removed significant amounts 
of surface pollution from the two prototypes. (The 
kilograms had also been cleaned at NIST but by a 
less effective technique). Since 1984, NIST has 
adopted the BIPM cleaning method. Values ob- 
tained for the difference in mass between K20 and 
K4 are shown in figure 3. These have standard de- 
viation of 0.0019 mg. We would expect a standard 
deviation of 0.0013 mg based solely on the ob- 
served standard deviation of the balance which 
was used. The difference is negligible. 

3.4.2 Ni and N2 Throughout the last 10 years, 
Ni and N2 continued to be used as working stan- 
dards for routine mass calibrations. In 1982, they 
were measured against K20 and K4 prior to send- 
ing the latter two weights to BIPM for recalibra- 
tion. The results, calculated after receiving the new 
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Figure 3, Mass values of K20— K4 as a function of time. Mea- 
surements were taken on a balance having a standard deviation 
of 0.0018 mg for a single reading. 

BIPM certificate, indicate that the value of R/2 
was 0.103 mg±0.025 mg below that accepted. The 
uncertainty is at an estimated level of one standard 
deviation and is dominated by problems with auxil- 
iary equipment used in measuring air buoyancy. 
The value of C was found to be —19.474 
mg±0.003 mg, consistent with the control chart 
data shown in figure 2. 

From 1986 to 1988, mass values of Ni and N2 
were determined three times against K20 and K4 in 
a more careful series of measurements. Several 
other stainless-steel kilograms were also involved 
in the measurements. These are discussed in section 
3.4.3, below. It is sufficient to mention at this point 
that this series of measurements was consistent 
with the long-term measurements of the other kilo- 
grams involved. The results of the 1986-1988 mea- 
surements are summarized in table 1. The 
uncertainty types and the rules for combining un- 
certainty conform to recommendations of the 
CIPM [3]. (This reference defines Type A and 
Type B uncertainties.) Components 2, 4, and 5 will 
be discussed in more detail in section 3.4.3. In as- 
sessing whether the observed change in R/2 after 
1986 is significant, one must not include Type B 
components, which we believe to be systematic to 
all measurements in table 1. It is interesting to note 
that the observed change in R/2 after 1986 is three 
times greater than the change in C. It is also inter- 
esting that the data of figure 1 show a statistically 
significant variation with time. A linear fit to the 
data predicts that the value of C in April 1988 was 
— 19.454+0.0028 mg (1 standard deviation), in sat- 
isfactory agreement with the measurement shown 
in table 1. 
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Table 1. Recent determinations of the masses of kilograms Ni 
and Nj with respect to secondary standards calibrated against 
K20. The values of C are subject to a measurement uncertainty 
of 0.0013 mg (1 standard deviation) 

R/1 C 

Date [(N, + N2)/2] [N.-Nj] 

1986 Aug 1 kg -5.159 mg - 19.440 mg 

1987 Nov 1 kg -5.192 mg -19.451 mg 

1988 Apr 1 kg -5.193 mg -19.447 mg 

Accepted values: 1 kg -5.0295 mg - 19.476 mg (1965) 

- 19.454 mg (Apr. 1988) 

Uncertainty (1 standard deviation or 68 percent confidence 
level) fbr measured values of /J/2 



Component 



Type A Type B 



1. Instability of K20 since 

1985 BIPM calibration included in 4 nil 

2. Calibration precision 

of secondary standard 0.0035 mg nil 

3. Correction of secondary 

standards for air buoyancy included in 2 0.01 mg 

4. Instability of 

secondary standards 0.0036 mg nil 

5. Calibration precision of 

(Ni + N2)/2 0.0005 mg nil 

6. Correction of Ni and Nj 

for air buoyancy nil 0.001 mg 

RSS 0.0050 mg 0.010 mg 

Combined Type A and B: 0.011 mg 



During this period, several kilograms which 
were submitted to NIST for calibration were mea- 
sured against Nj and N2 using routine calibration 
procedures. The test kilograms were also measured 
against stainless-steel kilograms which are dis- 
cussed in the next section using our best 1-kg bal- 
ance. The results were, in all cases, consistent with 
table 1. 

There was now good evidence that the accepted 
value of R/2 was 0.164 mg below the accepted 
value. Less certain evidence suggests that more 
than half of this difference had been present since 
at least 1969 (see sec. 2.2.1). This computes to an 
average change of order —0.004 ppm/yr. 

The standards Ni and N2 were again checked in 
1989. Although these measurements were not as 
extensive, they show that the average mass had 



dropped by another 0.05±0.013 mg (1 standard de- 
viation) with respect to four stainless-steel kilo- 
grams reserved for special use. This change thus 
appears to be real and serves as a warning that Nj 
and N2 are now losing mass at a greatly increased 
rate. The value of C measured during these mea- 
surements had returned to within 0.012 mg of the 
accepted value. 

3.4.3 New Secondary and Working Standards of 
Mass Kilograms Ni and N2 have served as both 
secondary standards — artifacts of practical density 
which most accurately represent mass as specified 
in the SI; and working standards — artifacts of prac- 
tical density used as standards in routine calibration 
work. Our intention was to separate these roles by 
acquisition of new standards, all made of non-mag- 
netic stainless steel. The choice of alloy simply re- 
flects the fact that the highest quality 1-kg weights 
which are commercially available are now made of 
stainless steel. Several stainless-steel kilograms 
were already on hand for use as secondardy stan- 
dards. Three of these, designated D2, El, and E2 
are about 25 years old. The physical characteristics 
of all three kilograms are similar; D2 was described 
in some detail in [1]. We also made use of a newer 
kilogram, designated CH-1, whose characteristics 
are also described in [1]. The four artifacts were 
grouped in pairs: CH-1 and D2 formed one pair 
while El and E2 formed the second pair. When not 
in use, the pairs were stored in separate containers 
of different design. The pair El, E2 was never sub- 
jected to any type of cleaning except for gentle 
dusting with a soft brush. The pair CH-1, D2 was 
cleaned on various occasions. 

The pair CH-1, D2 was compared eight times 
against primary standards K20 and K4. The mass 
values of CH-1 resulting from these measurements 
are shown in figure 4(a). Figure 4(b) shows mea- 
surements of the mass difference between CH-1 
and D2. Note that results displayed in figure 4(a) 
include a buoyancy correction of approximately 95 
mg while the correction for air buoyancy needed 
for the results in figure 4(b) was less than 3 mg. 
Figure 5 shows similar data for the pair El, E2. In 
this case, however, the pair CH-1, D2 was used as 
the standard. The mass value assigned to the stan- 
dard was the same for all the data shown. Pertinent 
statistical parameters are summarized in table 2. 
The outlying point in the mass difference of CH-1 
and D2 was repeatable. Because the difference re- 
turned to its previous values upon recleaning the 
two kilograms, we assume the outlying value was 
due to some type of surface contamination. At any 



86 



Volume 95, Number 1, January-February 1990 

Journal of Research of the National Institute of Standards and Technology 

Table 2. Statistical parameters inferred from measurements of secondary standards 



Mass of; 
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DF 


Sv/ 


Sb 


DF 


CH-1 


0.0085 mg 
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0.0013 mg 
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0.0052 mg 
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0.0013 mg 


0.0036 mg 


16.9 
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2.5 


E1-E2 
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Figure 4. (a) Mass of CH-1 as a function of time. These values 
are based on direct comparison with K20. The balance used has 
a standard deviation of 0.0018 mg for a single reading, (b) Mass 
of CH-1 — D2 as a function of time using the same balance as in 
(a). There is one outlying point which has been excluded in the 
data analysis. 

rate, the outlying point is not included in the calcu- 
lations for table 2. 

In table 2, 5,otai is the estimated standard deviation 
of the data shown in figures 4 and 5. The number of 
degrees of freedom in this estimate is given in the 
next column. The quantity s„ refers to the "within- 
group" standard deviation — that component of the 
observed standard deviation which can be at- 
tributed to the balance precision. This number is 
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Figure 5. (a) Mass of El as a function of time. These values are 
based on direct comparison with CH-1 and D2. The balance 
used has a standard deviation of 0.0018 mg for a single reading, 
(b) Mass of El — E2 as a function of time using the same balance 
as in (a). 

pooled from a great many measurements and thus 
has a large number of degrees of freedom. The "be- 
tween-group" standard deviation, 5b, is a measure 
of increased variability seen over long time peri- 
ods. This quantity is calculated from the others in 
the table. The estimated number of degrees of free- 
dom [16] in s^ is given in the last column. A full 
discussion of these parameters as well as their treat- 
ment in the context of mass calibrations has been 
given by Croarkin [17]. It is interesting to note that 



87 



Volume 95, Number 1, January-February 1990 

Journal of Research of the National Institute of Standards and Technology 



the data of figure 2, when subjected to the same 
analysis, indicate that s^, for these measurements is 
0.0116 mg [17]. 

The Croarkin model is not sufficient to model 
direct comparisons of CH-1 and D2 with K20 and 
K4. This is because uncertainties in buoyancy cor- 
rections have little effect on measured differences 
between weights of the same density but have large 
effects on measured differences between weights of 
different density. While the transducers used to 
measure the parameters of temperature, pressure, 
relative humidity, and carbon-dioxide level have 
excellent short-term precision, slow drifting be- 
tween recalibration leads to an additional "between 
group" uncertainty. If the error model of [17] is 
extended to include buoyancy effects, the data of 
table 2 can be used to compute an additional 
parameter Sp=0.007 mg (DF=3.9). This parameter 
characterizes daily variability in the measured mass 
difference between a kilogram of platinum-iridium 
and a kilogram of stainless steel due solely to mea- 
surement of the air buoyancy correction. 

Although based on somewhat limited data, it 
seems that El and E2, kilograms of the identical 
alloy and which are never cleaned, have a more 
stable mass than CH-1 and D2. This is a curious 
result in the sense that mass values for El and E2 
are based on direct comparison with CH-1 and D2. 
In these comparisons, it is assumed that the summa- 
tion mass of CH-1 and D2 is the average of all 
recent measurements which are in statistical con- 
trol. The evidence thus suggests that this average is 
a better estimate of the mass of CH-1 and D2 than, 
for instance, the most recently obtained values. 

As mentioned in the introduction to this section, 
it was envisioned that use of Ni and N2 as working 
standards would be superseded by stainless-steel 
kilograms. These would have a nominal density of 
8000 kg-m"l In 1985, six such kilograms, identical 
to CH-1, were obtained for this purpose. They are 
marked 1,2,. ..,6 but for purposes of discussion we 
shall refer to them as CI, C2,...,C6. Until January 
1988, these six kilograms were used extensively for 
various cleaning studies. Now, however, they will 
be used as working standards as described below in 
section 4. 



4. Summary of the Change on January 1, 
1990 

Beginning on January 1, 1990, the mass values 
assigned to working standards of the NIST calibra- 
tion service are based on a calibration chain which 



starts with mass values assigned to NIST primary 
standards K20 and K4 by the BIPM, continues 
with mass values assigned to secondary standards 
CH-1 and D2 with direct reference to K20 and K4, 
and finally to working standards CI, C2,...,C6 by 
direct reference to CH-1 and D2. 



4.1 Effect on Industry and Technology 

An Ad Hoc Committee of the National Confer- 
ence of Standards Laboratories (NCSL) was 
formed in order to help assess industrial and tech- 
nological implications of the actions contemplated 
for January 1, 1990. Members of the Committee 
include representatives from civilian and military 
standards laboratories, balance manufacturers, and 
weight manufacturers. All were asked to estimate 
the impact which a change of roughly 0. 1 5 mg/kg 
would have on their programs. The members could 
not identify a single instance where such a change 
would affect a manufactured product or a critical 
measurement. Virtually all concerned, however, 
recognized that a change of this magnitude could 
be noticeable within their metrology laboratory. 
This is not surprising since typical NIST calibra- 
tions give an uncertainty of about 0.075 mg (3 stan- 
dard deviations) for calibrations of 1-kg standards 
and users of these standards often have balances of 
comparable precision to our own. 

In recent years, calibrations for primary national 
laboratories of other countries have been carried 
out using secondary standards CH-1 and D2 with 
assigned values based directly on measurements 
against K20. These measurements are not, there- 
fore, in need of correction. 



4.2 Implementation 

Based on the data shown in section 3.4.2, it is 
clear that, by 1988, mass values assigned to NIST 
working standards were some 0.164 mg/kg higher 
than our best estimate of their actual value (that is, 
the value directly traceable to the representation of 
the SI unit of mass). At the beginning of the 
decade, the discrepancy was about 0.10 mg/kg. 
There is evidence that, between 1988 and 1989, the 
discrepancy grew still greater. 

In early 1988, and based on the data available to 
that point, it was decided to assign new mass values 
to NIST working standards on January 1, 1990. On 
the same date, the new quality-control procedures 
designed to keep mass values assigned to NIST 
working standards closely tied to the SI representa- 
tion of mass would be in place. Various standards 
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organizations were informed of these intentions by 
letter. The letter also stated that the new mass val- 
ues would be of the order of 0.15 ppm lower than 
the present values. Also in 1988, the NCSL Ad 
Hoc Committee was established to help in the im- 
plementation of the change. The target date of Jan- 
uary 1, 1990 was chosen to coincide with the date 
on which international changes in the representa- 
tions of the SI volt, ohm, and kelvin would be im- 
plemented. Guidelines developed by the Ad Hoc 
Committee are given in the Appendix. 

These guidelines treat the discrepancy between 
the accepted mass of NIST working standards and 
the mass traceable to SI representations as equal in 
magnitude to 0.17 mg/kg (0.17 ppm) throughout 
the decade from 1980 through 1989. Based on data 
presented above, we see that this is an oversimplifi- 
cation. Our best data, taken between 1986 and 
1988, give the discrepancy as 0.164 mg/kg. Less 
accurate data, however, suggest that the dis- 
crepancy grew slowly throughout the decade and 
then increased rapidly in the last year. A time-de- 
pendent correction algorithm with time-dependent 
uncertainty could, of course, be devised based on 
these data. The complexity of applying such an al- 
gorithm combined with its trivial scientific or tech- 
nological benefit made this course unwise. Instead 
we recommend correction of —0.17 mg/kg made 
to NIST calibration certificates dated during the 
1980s. This, we believe, will provide sufficient con- 
tinuity with certificates issued after January 1, 
1990. 

The BIPM is conducting the 3rd verification of 
national prototype kilograms. When this exercise is 
completed (perhaps in 2 years) we will have a 
much better idea of the internal stability of BIPM 
standards and the stability of these standards with 
respect to the national prototype kilograms. For 
the present, we estimate that the mass values used 
by NIST in its calibrations represent SI values as 
maintained by the BIPM to within 0.03 mg/kg or 
0.03 ppm (1 standard deviation). This uncertainty 
will not be included in NIST calibration reports 
except to say that it is systematic to all mass mea- 
surements. 



5. Future Plans 

We plan to participate in the 3rd verification of 
national prototype kilograms being organized by 
the BIPM. Consequently, in early 1990, we will 
send our national prototype (K20) to BIPM for a 
lengthy set of comparisons. 



We plan to recalibrate our working standards in 
terms of secondary standards CH-1 and D2 at ap- 
proximately 6-month intervals. The working stan- 
dards will not, initially, be cleaned although the 
secondary standards will. We foresee calibrating 
the secondary standards in terms of our primary 
standards K20 and K4 at about 2-year intervals. 
Based on the data presented above, we believe this 
procedure will permit us to know the mass ratio 
between our working standards and our primary 
standards to within 0.01 ppm (1 standard deviation) 
at all times. As noted at the end of the previous 
section, this uncertainty does not include possible 
discrepancies between NIST standards and those 
of the BIPM. We tentatively set the latter uncer- 
tainty at 0.03 ppm (1 standard deviation). 

It would be helpful to have a balance of 1-kg 
capacity and a standard deviation of order 0.005 
mg for use in routine calibration work. Such a 
device would help compensate for the fact that, 
since January 1, 1990, we are formally recognizing 
that our working standards are subject to uncer- 
tainty. 

A major goal of the new quality-control system 
is to improve international compatibility regarding 
practical mass standards. We are, therefore, seek- 
ing to promote international comparisons of stain- 
less-steel mass standards in order to ascertain the 
degree of compatibility among various industrial- 
ized countries. 

In conclusion, we note that a system of metrol- 
ogy ultimately based on an artifact standard will 
necessarily have shortcomings. Over a long 
enough period of time, mass differences between 
any two artifact standards will be unstable; the esti- 
mated standard deviation based on the complete 
data record will diverge. If the mass of one of the 
artifacts is arbitrarily assumed to be constant, its 
actual instability will in time be revealed by mea- 
surements of true physical constants. While there 
has as yet been no such revelation [18], modern 
technology may soon be expected to put the 
present definition of the SI kilogram to a severe 
test. 



6. Appendix. Notice of Change in the 
Unit of Mass Traceable to The Na- 
tional Institute of Standards and Tech- 
nology 

On January 1, 1990 the unit of mass as dissemi- 
nated by the National Institute of Standards and 
Technology (NIST) will shift by 0.17 mgAg (0.17 
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ppm). This small shift will bring the unit of mass 
traceable to NIST into better agreement with inter- 
national standards. Since the avoirdupois pound is 
defined as 0.45359237 kg, pound masses traceable 
to NIST will also be affected to the same extent 
(0.17/xlb/lb, or 0.1 7 ppm). 

Most people will be unaffected by this small 
change so that continued traceability to NIST can 
be maintained without taking any action. Unaf- 
fected users will be those whose mass standards are 
assigned an uncertainty greater than 1 mg/kg or 
1/Lilb/lb (1 ppm). Included in the mw affected list 
are: 

1. Analytical weights certified to be within any 
of the tolerances prescribed by NIST/NBS or 
ASTM/ANSI or to any OIML tolerance ex- 
cept E]. 

2. Direct-reading balances and scales. 

3. Any analytical weights which have been as- 
signed an uncertainty greater than 1 mg/kg or 
1 /xlb/lb (1 ppm). [This will typically include 
all weights greater than 2 kg or less than 20 g 
which were calibrated by NIST/NBS (see 
table Al). In some special cases, however, 
NIST calibrations at weight denominations 
other than those shown in table Al may have 
an uncertainty lower than 1 mg/kg.] 

Traceability to NIST of the above three cate- 
gories is unaffected by the change which will take 
effect on January 1, 1990. No action need be taken. 
In addition, any calibration certificate dated Janu- 
ary 1, 1990 or later already has any necessary 
changes incorporated. 

Table Al. Typically, action need be taken only for these nomi- 
nal values of weights and only if the assigned uncertainty is 
below the value given. Although this table shows weight de- 
nominations most likely to require correction, denominations 
which may require correction are not necessarily limited to 
those shown 

Nominal mass Uncertainty Nominal mass Uncertainty 



2 kg 
1kg 

500 g 
300 g 
200 g 
100 g 

50 g 
30 g 
20 g 



2.00 mg 
1.00 mg 

0.50 mg 
0.30 mg 
0.20 mg 
0.10 mg 

0.05 mg 
0.03 mg 
0.02 mg 



50 1b 
30 1b 
20 1b 
10 1b 

5 1b 
3 lb 
2 1b 
1 lb 

0.5 1b 
0.3 1b 
0.2 1b 



50;xlb 


30;xlb 


20;xlb 


lO^xlb 


5 ;xlb 


3 ;xlb 


2;xlb 


1 ^Ib 


0.5 ;xlb 


0.3 nib 


0.2 ^Ib 



Weights which will be affected by the change 
which will take effect on January 1, 1990 are all 
those which do not fall into category 3 above and, 
in addition, whose calibration certificate bears a 
date before January 1, 1990. Affected weights are 
those which have an assigned calibration uncer- 
tainty of less than 1 mg/kg (1 ppm). Based on typi- 
cal NIST calibration reports, these will generally 
be weights with denominations between 2 kg and 
20 g or 5 lb and 0.2 lb. Other denominations may be 
affected in special cases, however. 

The following actions will be necessary in order 
to maintain traceability to NIST for the affected 
weights: 

a. Weights whose calibration certificate bears a 
date after January 1, 1980 and before January 
1, 1990. 

After January 1, 1990 the mass of each 
affected weight should be reduced by 0.17 
mg/kg (0.17 ppm) as shown in table A2. This 
applies both to the true mass and the apparent 
mass. The uncertainty stated in the report re- 
mains the same. 

(Alternatively, the mass values stated in the 
calibration certificate may remain uncor- 
rected provided the stated uncertainty is in- 
creased by 0.17 mg/kg). 

b. Weight sets whose calibration certificate 
bears a date before January 1, 1980 but which 
have been subjected to a surveillance test 
within the 10 years preceding January 1, 
1990. (An example of a surveillance report 

Table A2. Corrections to apply to calibrations dated between 
January 1, 1980 and January 1, 1990. The denominations shown 
are those of table Al 



Nominal mass Correction 



Nominal mass 



Correction 



2 kg 
1kg 

500 g 
300 g 
200 g 
100 g 

50 g 
30 g 
20 g 



0.3400 mg 


50 1b 


-8.500 ^Ib 


0.1700 mg 


301b 


-5.100 ^Ib 




20 1b 


-3.400 ^Ib 


0.0850 mg 


10 1b 


- 1.700 ^Ib 


0.0510 mg 






0.0340 mg 


5 1b 


-0.850 ^Ib 


0.0170 mg 


3 lb 


-0.510 ^Ib 




2 1b 


-0.340 ^Ib 


0.0085 mg 


1 lb 


-0.170 ^Ib 


0.0051 mg 






0.0034 mg 


0.5 lb 


-0.085 ^Ib 




0.3 1b 


-0.051 ^Ib 




0.2 1b 


-0.034 ^Ib 
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issued by NIST is shown at the end of this 
Appendix. 

After January 1, 1990 the mass of each af- 
fected weight should be reduced by 0.17 mg/ 
kg (0.17 ppm) as shown in table A2. This 
applies both to the true mass and the apparent 
mass. The assertions of the surveillance report 
will remain in effect. 

(Alternatively, the mass values stated in the 
calibration certificate may remain uncor- 



rected provided the stated uncertainty is in- 
creased by 0.17 mg/kg). 

Weights whose calibration certificate bears a 
date before January 1, 1980 and which have 
had no surveillance test subsequent to January 
1, 1980. 

After January 1, 1990 the uncertainty as- 
signed to each affected weight should be in- 
creased by 0.17 mg/kg (0.17 ppm) until a new 
calibration or surveillance test is performed. 



Sample Surveillance Report Issued by NIST 

April 1, 1987 

In reply refer to: 731/12345 

Company XYZ 
1 Metrology Blvd. 
Grovers Corner, NJ 00000 
Attention: J. Doe 

Subject: Recalibration of Mass Standards previously calibrated under NBS Test No. 00/GOOOOO (copy 
attached) 

Items: Nine (9) Mass Standards: 100 g— 1 g 

The above items have been intercompared in sums. The differences as measured have been compared with 
the differences computed from the value under GOOOOO. One or more of the items have been checked against 
national standards. The results of this test indicate that there is no significant change since the last calibra- 
tion. This test assures the continuing accuracy of the values under GOOOOO. 

Sincerely, 



Richard S. Davis 

Group Leader, Mass Group 

Center for Manufacturing Engineering 

Attachment 
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